Dataset statistics
| Number of variables | 26 |
|---|---|
| Number of observations | 96844 |
| Missing cells | 162203 |
| Missing cells (%) | 6.4% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 19.2 MiB |
| Average record size in memory | 208.0 B |
Variable types
| Text | 10 |
|---|---|
| Unsupported | 9 |
| DateTime | 1 |
| Categorical | 5 |
| Numeric | 1 |
Tipo_restos is highly imbalanced (82.7%) | Imbalance |
Conocido_desconocido is highly imbalanced (55.8%) | Imbalance |
Primer_apellido has 1878 (1.9%) missing values | Missing |
Nombres_propios has 2095 (2.2%) missing values | Missing |
Procedencia_alcaldia has 31946 (33.0%) missing values | Missing |
Procedencia_acta has 25635 (26.5%) missing values | Missing |
Diagnostico_estandar has 8267 (8.5%) missing values | Missing |
Diagnostico_extendido has 8267 (8.5%) missing values | Missing |
Observaciones has 82723 (85.4%) missing values | Missing |
ID has unique values | Unique |
Numero_progresivo_transcrito is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
Fecha_transcrito is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
Expediente_SEMEFO_transcrito is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
Procedencia_transcrito is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
Numero_acta_transcrito is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
Procedencia_acta is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
Edad_transcrito is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
Foja_transcrito is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
Observaciones is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
Reproduction
| Analysis started | 2025-02-12 02:01:12.334692 |
|---|---|
| Analysis finished | 2025-02-12 02:01:19.037178 |
| Duration | 6.7 seconds |
| Software version | ydata-profiling v4.8.3 |
| Download configuration | config.json |
ID
Text
UNIQUE 
| Distinct | 96844 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 756.7 KiB |
Length
| Max length | 13 |
|---|---|
| Median length | 13 |
| Mean length | 13 |
| Min length | 13 |
Characters and Unicode
| Total characters | 1258972 |
|---|---|
| Distinct characters | 13 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 96844 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | BO_1968_00001 |
|---|---|
| 2nd row | BO_1968_00002 |
| 3rd row | BO_1968_00003 |
| 4th row | BO_1968_00004 |
| 5th row | BO_1968_00005 |
| Value | Count | Frequency (%) |
| bo_1968_00001 | 1 | < 0.1% |
| bo_1968_00015 | 1 | < 0.1% |
| bo_1968_00006 | 1 | < 0.1% |
| bo_1968_00007 | 1 | < 0.1% |
| bo_1968_00008 | 1 | < 0.1% |
| bo_1968_00009 | 1 | < 0.1% |
| bo_1968_00010 | 1 | < 0.1% |
| bo_1968_00011 | 1 | < 0.1% |
| bo_1968_00046 | 1 | < 0.1% |
| bo_1968_00012 | 1 | < 0.1% |
| Other values (96834) | 96834 |
Most occurring characters
| Value | Count | Frequency (%) |
| _ | 193688 | |
| 1 | 154647 | |
| 0 | 154514 | |
| 9 | 137570 | |
| 7 | 102548 | |
| B | 96844 | |
| O | 96844 | |
| 8 | 62974 | 5.0% |
| 2 | 57774 | 4.6% |
| 6 | 53714 | 4.3% |
| Other values (3) | 147855 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 871596 | |
| Connector Punctuation | 193688 | 15.4% |
| Uppercase Letter | 193688 | 15.4% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 154647 | |
| 0 | 154514 | |
| 9 | 137570 | |
| 7 | 102548 | |
| 8 | 62974 | |
| 2 | 57774 | 6.6% |
| 6 | 53714 | 6.2% |
| 3 | 50864 | 5.8% |
| 4 | 49755 | 5.7% |
| 5 | 47236 | 5.4% |
Uppercase Letter
| Value | Count | Frequency (%) |
| B | 96844 | |
| O | 96844 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 193688 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 1065284 | |
| Latin | 193688 | 15.4% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| _ | 193688 | |
| 1 | 154647 | |
| 0 | 154514 | |
| 9 | 137570 | |
| 7 | 102548 | |
| 8 | 62974 | 5.9% |
| 2 | 57774 | 5.4% |
| 6 | 53714 | 5.0% |
| 3 | 50864 | 4.8% |
| 4 | 49755 | 4.7% |
Latin
| Value | Count | Frequency (%) |
| B | 96844 | |
| O | 96844 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1258972 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| _ | 193688 | |
| 1 | 154647 | |
| 0 | 154514 | |
| 9 | 137570 | |
| 7 | 102548 | |
| B | 96844 | |
| O | 96844 | |
| 8 | 62974 | 5.0% |
| 2 | 57774 | 4.6% |
| 6 | 53714 | 4.3% |
| Other values (3) | 147855 |
Numero_progresivo_transcrito
Unsupported
REJECTED  UNSUPPORTED 
| Missing | 0 |
|---|---|
| Missing (%) | 0.0% |
| Memory size | 756.7 KiB |
| Distinct | 75516 |
|---|---|
| Distinct (%) | 78.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 756.7 KiB |
Length
| Max length | 132 |
|---|---|
| Median length | 48 |
| Mean length | 22.59474 |
| Min length | 1 |
Characters and Unicode
| Total characters | 2188165 |
|---|---|
| Distinct characters | 61 |
| Distinct categories | 9 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 2 ? |
Unique
| Unique | 73906 ? |
|---|---|
| Unique (%) | 76.3% |
Sample
| 1st row | acosta ortega teresa |
|---|---|
| 2nd row | avila de cuestas catalina |
| 3rd row | arzate paredes juan |
| 4th row | alvarez martinez isaac |
| 5th row | arellano viuda de campos ma. |
| Value | Count | Frequency (%) |
| desconocido | 13187 | 4.2% |
| de | 9468 | 3.0% |
| hernandez | 5847 | 1.9% |
| garcia | 4946 | 1.6% |
| martinez | 4625 | 1.5% |
| con | 4320 | 1.4% |
| jose | 4266 | 1.4% |
| gonzalez | 3775 | 1.2% |
| lopez | 3316 | 1.1% |
| sanchez | 3077 | 1.0% |
| Other values (14220) | 257038 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 247825 | |
| 217022 | 9.9% | |
| e | 213704 | 9.8% |
| o | 202460 | 9.3% |
| r | 168328 | 7.7% |
| n | 140674 | 6.4% |
| i | 132017 | 6.0% |
| c | 103482 | 4.7% |
| l | 102371 | 4.7% |
| d | 97298 | 4.4% |
| Other values (51) | 562984 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 1935397 | |
| Space Separator | 217022 | 9.9% |
| Uppercase Letter | 17266 | 0.8% |
| Other Punctuation | 11718 | 0.5% |
| Decimal Number | 6511 | 0.3% |
| Open Punctuation | 102 | < 0.1% |
| Close Punctuation | 100 | < 0.1% |
| Dash Punctuation | 48 | < 0.1% |
| Math Symbol | 1 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 247825 | |
| e | 213704 | |
| o | 202460 | |
| r | 168328 | 8.7% |
| n | 140674 | 7.3% |
| i | 132017 | 6.8% |
| c | 103482 | 5.3% |
| l | 102371 | 5.3% |
| d | 97298 | 5.0% |
| s | 92146 | 4.8% |
| Other values (18) | 435092 |
Uppercase Letter
| Value | Count | Frequency (%) |
| I | 2158 | |
| P | 2158 | |
| A | 2158 | |
| G | 2158 | |
| T | 2158 | |
| L | 2158 | |
| S | 2158 | |
| N | 2158 | |
| E | 1 | < 0.1% |
| B | 1 | < 0.1% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 10307 | |
| " | 1031 | 8.8% |
| ' | 333 | 2.8% |
| : | 23 | 0.2% |
| , | 10 | 0.1% |
| / | 5 | < 0.1% |
| ? | 4 | < 0.1% |
| # | 4 | < 0.1% |
| * | 1 | < 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 4322 | |
| 6 | 2158 | |
| 2 | 14 | 0.2% |
| 3 | 11 | 0.2% |
| 4 | 3 | < 0.1% |
| 5 | 2 | < 0.1% |
| 0 | 1 | < 0.1% |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 89 | |
| ] | 11 | 11.0% |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 89 | |
| [ | 13 | 12.7% |
Space Separator
| Value | Count | Frequency (%) |
| 217022 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 48 |
Math Symbol
| Value | Count | Frequency (%) |
| + | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1952663 | |
| Common | 235502 | 10.8% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 247825 | |
| e | 213704 | |
| o | 202460 | |
| r | 168328 | 8.6% |
| n | 140674 | 7.2% |
| i | 132017 | 6.8% |
| c | 103482 | 5.3% |
| l | 102371 | 5.2% |
| d | 97298 | 5.0% |
| s | 92146 | 4.7% |
| Other values (28) | 452358 |
Common
| Value | Count | Frequency (%) |
| 217022 | ||
| . | 10307 | 4.4% |
| 1 | 4322 | 1.8% |
| 6 | 2158 | 0.9% |
| " | 1031 | 0.4% |
| ' | 333 | 0.1% |
| ) | 89 | < 0.1% |
| ( | 89 | < 0.1% |
| - | 48 | < 0.1% |
| : | 23 | < 0.1% |
| Other values (13) | 80 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2183846 | |
| None | 4319 | 0.2% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 247825 | |
| 217022 | 9.9% | |
| e | 213704 | 9.8% |
| o | 202460 | 9.3% |
| r | 168328 | 7.7% |
| n | 140674 | 6.4% |
| i | 132017 | 6.0% |
| c | 103482 | 4.7% |
| l | 102371 | 4.7% |
| d | 97298 | 4.5% |
| Other values (49) | 558665 |
None
| Value | Count | Frequency (%) |
| í | 4316 | |
| ñ | 3 | 0.1% |
Primer_apellido
Text
MISSING 
| Distinct | 7797 |
|---|---|
| Distinct (%) | 8.2% |
| Missing | 1878 |
| Missing (%) | 1.9% |
| Memory size | 756.7 KiB |
Length
| Max length | 29 |
|---|---|
| Median length | 26 |
| Mean length | 6.0098983 |
| Min length | 1 |
Characters and Unicode
| Total characters | 570736 |
|---|---|
| Distinct characters | 36 |
| Distinct categories | 8 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 2 ? |
Unique
| Unique | 5370 ? |
|---|---|
| Unique (%) | 5.7% |
Sample
| 1st row | acosta |
|---|---|
| 2nd row | avila |
| 3rd row | arzate |
| 4th row | alvarez |
| 5th row | arellano |
| Value | Count | Frequency (%) |
| s-d | 18107 | 18.5% |
| hernandez | 2961 | 3.0% |
| garcia | 2577 | 2.6% |
| martinez | 2353 | 2.4% |
| gonzalez | 1934 | 2.0% |
| lopez | 1694 | 1.7% |
| sanchez | 1583 | 1.6% |
| rodriguez | 1480 | 1.5% |
| perez | 1407 | 1.4% |
| ramirez | 1402 | 1.4% |
| Other values (7084) | 62188 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 75980 | |
| e | 60221 | |
| r | 54691 | 9.6% |
| s | 41022 | 7.2% |
| o | 40128 | 7.0% |
| d | 35235 | 6.2% |
| n | 34549 | 6.1% |
| z | 32630 | 5.7% |
| i | 28647 | 5.0% |
| l | 28102 | 4.9% |
| Other values (26) | 139531 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 549554 | |
| Dash Punctuation | 18109 | 3.2% |
| Space Separator | 2728 | 0.5% |
| Other Punctuation | 338 | 0.1% |
| Open Punctuation | 3 | < 0.1% |
| Close Punctuation | 2 | < 0.1% |
| Math Symbol | 1 | < 0.1% |
| Decimal Number | 1 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 75980 | |
| e | 60221 | |
| r | 54691 | |
| s | 41022 | 7.5% |
| o | 40128 | 7.3% |
| d | 35235 | 6.4% |
| n | 34549 | 6.3% |
| z | 32630 | 5.9% |
| i | 28647 | 5.2% |
| l | 28102 | 5.1% |
| Other values (17) | 118349 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 217 | |
| " | 118 | |
| ' | 3 | 0.9% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 18109 |
Space Separator
| Value | Count | Frequency (%) |
| 2728 |
Open Punctuation
| Value | Count | Frequency (%) |
| [ | 3 |
Close Punctuation
| Value | Count | Frequency (%) |
| ] | 2 |
Math Symbol
| Value | Count | Frequency (%) |
| + | 1 |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 549554 | |
| Common | 21182 | 3.7% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 75980 | |
| e | 60221 | |
| r | 54691 | |
| s | 41022 | 7.5% |
| o | 40128 | 7.3% |
| d | 35235 | 6.4% |
| n | 34549 | 6.3% |
| z | 32630 | 5.9% |
| i | 28647 | 5.2% |
| l | 28102 | 5.1% |
| Other values (17) | 118349 |
Common
| Value | Count | Frequency (%) |
| - | 18109 | |
| 2728 | 12.9% | |
| . | 217 | 1.0% |
| " | 118 | 0.6% |
| [ | 3 | < 0.1% |
| ' | 3 | < 0.1% |
| ] | 2 | < 0.1% |
| + | 1 | < 0.1% |
| 1 | 1 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 570735 | |
| None | 1 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 75980 | |
| e | 60221 | |
| r | 54691 | 9.6% |
| s | 41022 | 7.2% |
| o | 40128 | 7.0% |
| d | 35235 | 6.2% |
| n | 34549 | 6.1% |
| z | 32630 | 5.7% |
| i | 28647 | 5.0% |
| l | 28102 | 4.9% |
| Other values (25) | 139530 |
None
| Value | Count | Frequency (%) |
| ñ | 1 |
Segundo_apellido
Text
| Distinct | 8317 |
|---|---|
| Distinct (%) | 8.6% |
| Missing | 281 |
| Missing (%) | 0.3% |
| Memory size | 756.7 KiB |
Length
| Max length | 25 |
|---|---|
| Median length | 23 |
| Mean length | 5.9092717 |
| Min length | 1 |
Characters and Unicode
| Total characters | 570617 |
|---|---|
| Distinct characters | 39 |
| Distinct categories | 7 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 2 ? |
Unique
| Unique | 5633 ? |
|---|---|
| Unique (%) | 5.8% |
Sample
| 1st row | ortega |
|---|---|
| 2nd row | de cuestas |
| 3rd row | paredes |
| 4th row | martinez |
| 5th row | viuda de campos |
| Value | Count | Frequency (%) |
| s-d | 21916 | 21.6% |
| de | 3109 | 3.1% |
| hernandez | 2881 | 2.8% |
| garcia | 2366 | 2.3% |
| martinez | 2263 | 2.2% |
| gonzalez | 1839 | 1.8% |
| lopez | 1613 | 1.6% |
| sanchez | 1495 | 1.5% |
| rodriguez | 1398 | 1.4% |
| perez | 1344 | 1.3% |
| Other values (6954) | 61156 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 73198 | |
| e | 59670 | |
| r | 52015 | 9.1% |
| s | 43698 | 7.7% |
| d | 40930 | 7.2% |
| o | 38814 | 6.8% |
| n | 33651 | 5.9% |
| z | 31111 | 5.5% |
| i | 27957 | 4.9% |
| l | 26826 | 4.7% |
| Other values (29) | 142747 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 542311 | |
| Dash Punctuation | 21917 | 3.8% |
| Space Separator | 4817 | 0.8% |
| Other Punctuation | 1555 | 0.3% |
| Close Punctuation | 8 | < 0.1% |
| Open Punctuation | 8 | < 0.1% |
| Decimal Number | 1 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 73198 | |
| e | 59670 | |
| r | 52015 | |
| s | 43698 | 8.1% |
| d | 40930 | 7.5% |
| o | 38814 | 7.2% |
| n | 33651 | 6.2% |
| z | 31111 | 5.7% |
| i | 27957 | 5.2% |
| l | 26826 | 4.9% |
| Other values (17) | 114441 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 817 | |
| " | 542 | |
| ' | 194 | 12.5% |
| , | 1 | 0.1% |
| ? | 1 | 0.1% |
Close Punctuation
| Value | Count | Frequency (%) |
| ] | 4 | |
| ) | 4 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 4 | |
| [ | 4 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 21917 |
Space Separator
| Value | Count | Frequency (%) |
| 4817 |
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 542311 | |
| Common | 28306 | 5.0% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 73198 | |
| e | 59670 | |
| r | 52015 | |
| s | 43698 | 8.1% |
| d | 40930 | 7.5% |
| o | 38814 | 7.2% |
| n | 33651 | 6.2% |
| z | 31111 | 5.7% |
| i | 27957 | 5.2% |
| l | 26826 | 4.9% |
| Other values (17) | 114441 |
Common
| Value | Count | Frequency (%) |
| - | 21917 | |
| 4817 | 17.0% | |
| . | 817 | 2.9% |
| " | 542 | 1.9% |
| ' | 194 | 0.7% |
| ] | 4 | < 0.1% |
| ( | 4 | < 0.1% |
| ) | 4 | < 0.1% |
| [ | 4 | < 0.1% |
| 0 | 1 | < 0.1% |
| Other values (2) | 2 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 570615 | |
| None | 2 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 73198 | |
| e | 59670 | |
| r | 52015 | 9.1% |
| s | 43698 | 7.7% |
| d | 40930 | 7.2% |
| o | 38814 | 6.8% |
| n | 33651 | 5.9% |
| z | 31111 | 5.5% |
| i | 27957 | 4.9% |
| l | 26826 | 4.7% |
| Other values (28) | 142745 |
None
| Value | Count | Frequency (%) |
| ñ | 2 |
Nombres_propios
Text
MISSING 
| Distinct | 7777 |
|---|---|
| Distinct (%) | 8.2% |
| Missing | 2095 |
| Missing (%) | 2.2% |
| Memory size | 756.7 KiB |
Length
| Max length | 55 |
|---|---|
| Median length | 30 |
| Mean length | 6.3480037 |
| Min length | 1 |
Characters and Unicode
| Total characters | 601467 |
|---|---|
| Distinct characters | 46 |
| Distinct categories | 8 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 5504 ? |
|---|---|
| Unique (%) | 5.8% |
Sample
| 1st row | teresa |
|---|---|
| 2nd row | catalina |
| 3rd row | juan |
| 4th row | isaac |
| 5th row | ma. |
| Value | Count | Frequency (%) |
| s-d | 18138 | 16.8% |
| jose | 4193 | 3.9% |
| maria | 2661 | 2.5% |
| juan | 2414 | 2.2% |
| luis | 2160 | 2.0% |
| jesus | 1780 | 1.7% |
| antonio | 1757 | 1.6% |
| francisco | 1667 | 1.5% |
| manuel | 1518 | 1.4% |
| j | 1497 | 1.4% |
| Other values (3884) | 70047 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 72452 | |
| o | 55652 | 9.3% |
| e | 51934 | 8.6% |
| i | 46914 | 7.8% |
| s | 46171 | 7.7% |
| r | 45752 | 7.6% |
| n | 37381 | 6.2% |
| d | 36321 | 6.0% |
| l | 34313 | 5.7% |
| u | 23464 | 3.9% |
| Other values (36) | 151113 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 565653 | |
| Dash Punctuation | 18142 | 3.0% |
| Space Separator | 13088 | 2.2% |
| Other Punctuation | 4550 | 0.8% |
| Close Punctuation | 13 | < 0.1% |
| Open Punctuation | 13 | < 0.1% |
| Decimal Number | 6 | < 0.1% |
| Uppercase Letter | 2 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 72452 | |
| o | 55652 | |
| e | 51934 | |
| i | 46914 | |
| s | 46171 | |
| r | 45752 | 8.1% |
| n | 37381 | 6.6% |
| d | 36321 | 6.4% |
| l | 34313 | 6.1% |
| u | 23464 | 4.1% |
| Other values (16) | 115299 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 4361 | |
| " | 170 | 3.7% |
| ' | 9 | 0.2% |
| , | 6 | 0.1% |
| / | 2 | < 0.1% |
| * | 1 | < 0.1% |
| : | 1 | < 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 3 | 2 | |
| 2 | 1 | |
| 1 | 1 | |
| 7 | 1 | |
| 8 | 1 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 8 | |
| ] | 5 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 8 | |
| [ | 5 |
Uppercase Letter
| Value | Count | Frequency (%) |
| E | 1 | |
| B | 1 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 18142 |
Space Separator
| Value | Count | Frequency (%) |
| 13088 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 565655 | |
| Common | 35812 | 6.0% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 72452 | |
| o | 55652 | |
| e | 51934 | |
| i | 46914 | |
| s | 46171 | |
| r | 45752 | 8.1% |
| n | 37381 | 6.6% |
| d | 36321 | 6.4% |
| l | 34313 | 6.1% |
| u | 23464 | 4.1% |
| Other values (18) | 115301 |
Common
| Value | Count | Frequency (%) |
| - | 18142 | |
| 13088 | ||
| . | 4361 | 12.2% |
| " | 170 | 0.5% |
| ' | 9 | < 0.1% |
| ) | 8 | < 0.1% |
| ( | 8 | < 0.1% |
| , | 6 | < 0.1% |
| ] | 5 | < 0.1% |
| [ | 5 | < 0.1% |
| Other values (8) | 10 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 601467 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 72452 | |
| o | 55652 | 9.3% |
| e | 51934 | 8.6% |
| i | 46914 | 7.8% |
| s | 46171 | 7.7% |
| r | 45752 | 7.6% |
| n | 37381 | 6.2% |
| d | 36321 | 6.0% |
| l | 34313 | 5.7% |
| u | 23464 | 3.9% |
| Other values (36) | 151113 |
Fecha_transcrito
Unsupported
REJECTED  UNSUPPORTED 
| Missing | 0 |
|---|---|
| Missing (%) | 0.0% |
| Memory size | 756.7 KiB |
Fecha_estandar
Date
| Distinct | 5479 |
|---|---|
| Distinct (%) | 5.7% |
| Missing | 266 |
| Missing (%) | 0.3% |
| Memory size | 756.7 KiB |
| Minimum | 1968-01-01 00:00:00 |
|---|---|
| Maximum | 1982-12-31 00:00:00 |
Expediente_SEMEFO_transcrito
Unsupported
REJECTED  UNSUPPORTED 
| Missing | 0 |
|---|---|
| Missing (%) | 0.0% |
| Memory size | 756.7 KiB |
Procedencia_transcrito
Unsupported
REJECTED  UNSUPPORTED 
| Missing | 0 |
|---|---|
| Missing (%) | 0.0% |
| Memory size | 756.7 KiB |
| Distinct | 109 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 408 |
| Missing (%) | 0.4% |
| Memory size | 756.7 KiB |
Length
| Max length | 8 |
|---|---|
| Median length | 3 |
| Mean length | 2.9963499 |
| Min length | 2 |
Characters and Unicode
| Total characters | 288956 |
|---|---|
| Distinct characters | 40 |
| Distinct categories | 5 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | S-D |
|---|---|
| 2nd row | S-D |
| 3rd row | S-D |
| 4th row | S-D |
| 5th row | S-D |
| Value | Count | Frequency (%) |
| s-d | 26833 | |
| 32a | 3635 | 3.8% |
| 33a | 3054 | 3.2% |
| 37a | 2812 | 2.9% |
| 1a | 2445 | 2.5% |
| htb | 2294 | 2.4% |
| 20a | 2254 | 2.3% |
| 13a | 2136 | 2.2% |
| 35a | 2063 | 2.1% |
| 36a | 1936 | 2.0% |
| Other values (101) | 46978 |
Most occurring characters
| Value | Count | Frequency (%) |
| A | 52512 | |
| - | 37674 | |
| S | 26861 | |
| D | 26835 | |
| 3 | 23861 | |
| 1 | 21340 | 7.4% |
| 2 | 19625 | 6.8% |
| H | 11333 | 3.9% |
| C | 7784 | 2.7% |
| 4 | 6249 | 2.2% |
| Other values (30) | 54882 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 155504 | |
| Decimal Number | 95746 | |
| Dash Punctuation | 37674 | 13.0% |
| Lowercase Letter | 28 | < 0.1% |
| Space Separator | 4 | < 0.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 52512 | |
| S | 26861 | |
| D | 26835 | |
| H | 11333 | 7.3% |
| C | 7784 | 5.0% |
| M | 5014 | 3.2% |
| T | 4386 | 2.8% |
| B | 3055 | 2.0% |
| V | 3046 | 2.0% |
| R | 2879 | 1.9% |
| Other values (9) | 11799 | 7.6% |
Decimal Number
| Value | Count | Frequency (%) |
| 3 | 23861 | |
| 1 | 21340 | |
| 2 | 19625 | |
| 4 | 6249 | 6.5% |
| 7 | 5566 | 5.8% |
| 5 | 5137 | 5.4% |
| 6 | 4440 | 4.6% |
| 8 | 3284 | 3.4% |
| 0 | 3207 | 3.3% |
| 9 | 3037 | 3.2% |
Lowercase Letter
| Value | Count | Frequency (%) |
| o | 6 | |
| a | 6 | |
| p | 4 | |
| r | 2 | 7.1% |
| u | 2 | 7.1% |
| z | 2 | 7.1% |
| t | 2 | 7.1% |
| c | 2 | 7.1% |
| l | 2 | 7.1% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 37674 |
Space Separator
| Value | Count | Frequency (%) |
| 4 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 155532 | |
| Common | 133424 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| A | 52512 | |
| S | 26861 | |
| D | 26835 | |
| H | 11333 | 7.3% |
| C | 7784 | 5.0% |
| M | 5014 | 3.2% |
| T | 4386 | 2.8% |
| B | 3055 | 2.0% |
| V | 3046 | 2.0% |
| R | 2879 | 1.9% |
| Other values (18) | 11827 | 7.6% |
Common
| Value | Count | Frequency (%) |
| - | 37674 | |
| 3 | 23861 | |
| 1 | 21340 | |
| 2 | 19625 | |
| 4 | 6249 | 4.7% |
| 7 | 5566 | 4.2% |
| 5 | 5137 | 3.9% |
| 6 | 4440 | 3.3% |
| 8 | 3284 | 2.5% |
| 0 | 3207 | 2.4% |
| Other values (2) | 3041 | 2.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 288956 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| A | 52512 | |
| - | 37674 | |
| S | 26861 | |
| D | 26835 | |
| 3 | 23861 | |
| 1 | 21340 | 7.4% |
| 2 | 19625 | 6.8% |
| H | 11333 | 3.9% |
| C | 7784 | 2.7% |
| 4 | 6249 | 2.2% |
| Other values (30) | 54882 |
| Distinct | 97 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 437 |
| Missing (%) | 0.5% |
| Memory size | 756.7 KiB |
Length
| Max length | 95 |
|---|---|
| Median length | 74 |
| Mean length | 32.238634 |
| Min length | 2 |
Characters and Unicode
| Total characters | 3108030 |
|---|---|
| Distinct characters | 64 |
| Distinct categories | 8 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 2 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Sin datos |
|---|---|
| 2nd row | Sin datos |
| 3rd row | Sin datos |
| 4th row | Sin datos |
| 5th row | Sin datos |
| Value | Count | Frequency (%) |
| col | 60703 | 12.9% |
| ministerio | 48754 | 10.4% |
| público | 48754 | 10.4% |
| sin | 26835 | 5.7% |
| datos | 26835 | 5.7% |
| balbuena | 9989 | 2.1% |
| territorial | 9625 | 2.0% |
| coordinación | 9625 | 2.0% |
| 1 | 8111 | 1.7% |
| hospital | 7871 | 1.7% |
| Other values (149) | 213930 |
Most occurring characters
| Value | Count | Frequency (%) |
| 374625 | 12.1% | |
| i | 298705 | 9.6% |
| o | 297107 | 9.6% |
| a | 204152 | 6.6% |
| l | 182274 | 5.9% |
| n | 172098 | 5.5% |
| r | 170709 | 5.5% |
| e | 143069 | 4.6% |
| t | 124593 | 4.0% |
| s | 108133 | 3.5% |
| Other values (54) | 1032565 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 2126900 | |
| Space Separator | 374625 | 12.1% |
| Uppercase Letter | 373994 | 12.0% |
| Decimal Number | 99975 | 3.2% |
| Close Punctuation | 64494 | 2.1% |
| Open Punctuation | 64494 | 2.1% |
| Other Punctuation | 2759 | 0.1% |
| Other Letter | 789 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| i | 298705 | |
| o | 297107 | |
| a | 204152 | |
| l | 182274 | |
| n | 172098 | |
| r | 170709 | |
| e | 143069 | |
| t | 124593 | 5.9% |
| s | 108133 | 5.1% |
| c | 88472 | 4.2% |
| Other values (17) | 337588 |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 88711 | |
| M | 64456 | |
| P | 56524 | |
| S | 40822 | |
| T | 22677 | 6.1% |
| B | 16559 | 4.4% |
| A | 12383 | 3.3% |
| G | 11974 | 3.2% |
| H | 11955 | 3.2% |
| V | 10642 | 2.8% |
| Other values (12) | 37291 |
Decimal Number
| Value | Count | Frequency (%) |
| 3 | 23859 | |
| 2 | 22141 | |
| 1 | 21337 | |
| 4 | 6246 | 6.2% |
| 7 | 5566 | 5.6% |
| 5 | 5133 | 5.1% |
| 0 | 4932 | 4.9% |
| 6 | 4440 | 4.4% |
| 8 | 3284 | 3.3% |
| 9 | 3037 | 3.0% |
Space Separator
| Value | Count | Frequency (%) |
| 374625 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 64494 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 64494 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 2759 |
Other Letter
| Value | Count | Frequency (%) |
| ª | 789 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 2501683 | |
| Common | 606347 | 19.5% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| i | 298705 | |
| o | 297107 | |
| a | 204152 | 8.2% |
| l | 182274 | 7.3% |
| n | 172098 | 6.9% |
| r | 170709 | 6.8% |
| e | 143069 | 5.7% |
| t | 124593 | 5.0% |
| s | 108133 | 4.3% |
| C | 88711 | 3.5% |
| Other values (40) | 712132 |
Common
| Value | Count | Frequency (%) |
| 374625 | ||
| ) | 64494 | 10.6% |
| ( | 64494 | 10.6% |
| 3 | 23859 | 3.9% |
| 2 | 22141 | 3.7% |
| 1 | 21337 | 3.5% |
| 4 | 6246 | 1.0% |
| 7 | 5566 | 0.9% |
| 5 | 5133 | 0.8% |
| 0 | 4932 | 0.8% |
| Other values (4) | 13520 | 2.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3019464 | |
| None | 88566 | 2.8% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 374625 | 12.4% | |
| i | 298705 | 9.9% |
| o | 297107 | 9.8% |
| a | 204152 | 6.8% |
| l | 182274 | 6.0% |
| n | 172098 | 5.7% |
| r | 170709 | 5.7% |
| e | 143069 | 4.7% |
| t | 124593 | 4.1% |
| s | 108133 | 3.6% |
| Other values (46) | 943999 |
None
| Value | Count | Frequency (%) |
| ú | 48760 | |
| ó | 16792 | 19.0% |
| á | 8391 | 9.5% |
| í | 6582 | 7.4% |
| é | 5364 | 6.1% |
| ñ | 1295 | 1.5% |
| ª | 789 | 0.9% |
| Á | 593 | 0.7% |
Procedencia_alcaldia
Categorical
MISSING 
| Distinct | 16 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 31946 |
| Missing (%) | 33.0% |
| Memory size | 756.7 KiB |
| Cuauhtémoc | |
|---|---|
| Miguel Hidalgo | |
| Gustavo A. Madero | |
| Benito Juárez | |
| Venustiano Carranza | |
| Other values (11) |
Length
| Max length | 19 |
|---|---|
| Median length | 17 |
| Mean length | 13.087876 |
| Min length | 7 |
Characters and Unicode
| Total characters | 849377 |
|---|---|
| Distinct characters | 39 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 2 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Cuauhtémoc |
|---|---|
| 2nd row | Cuauhtémoc |
| 3rd row | Benito Juárez |
| 4th row | Coyoacán |
| 5th row | Cuauhtémoc |
Common Values
| Value | Count | Frequency (%) |
| Cuauhtémoc | 10686 | 11.0% |
| Miguel Hidalgo | 10494 | 10.8% |
| Gustavo A. Madero | 9012 | 9.3% |
| Benito Juárez | 8647 | 8.9% |
| Venustiano Carranza | 7580 | 7.8% |
| Iztacalco | 4232 | 4.4% |
| Coyoacán | 2638 | 2.7% |
| Álvaro Obregón | 2412 | 2.5% |
| Iztapalapa | 2356 | 2.4% |
| Azcapotzalco | 2203 | 2.3% |
| Other values (6) | 4638 | 4.8% |
| (Missing) | 31946 |
Length
| Value | Count | Frequency (%) |
| cuauhtémoc | 10686 | |
| hidalgo | 10494 | |
| miguel | 10494 | |
| gustavo | 9012 | |
| a | 9012 | |
| madero | 9012 | |
| benito | 8647 | |
| juárez | 8647 | |
| venustiano | 7580 | 6.7% |
| carranza | 7580 | 6.7% |
| Other values (14) | 21511 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 104873 | 12.3% |
| o | 74416 | 8.8% |
| u | 58358 | 6.9% |
| 47777 | 5.6% | |
| e | 47526 | 5.6% |
| t | 45336 | 5.3% |
| i | 40530 | 4.8% |
| n | 38790 | 4.6% |
| l | 38701 | 4.6% |
| r | 38010 | 4.5% |
| Other values (29) | 315060 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 679913 | |
| Uppercase Letter | 112675 | 13.3% |
| Space Separator | 47777 | 5.6% |
| Other Punctuation | 9012 | 1.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 104873 | |
| o | 74416 | |
| u | 58358 | 8.6% |
| e | 47526 | 7.0% |
| t | 45336 | 6.7% |
| i | 40530 | 6.0% |
| n | 38790 | 5.7% |
| l | 38701 | 5.7% |
| r | 38010 | 5.6% |
| c | 28969 | 4.3% |
| Other values (14) | 164404 |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 22041 | |
| M | 20126 | |
| A | 11468 | |
| H | 10494 | |
| G | 9012 | |
| J | 8647 | 7.7% |
| B | 8647 | 7.7% |
| V | 7580 | 6.7% |
| I | 6588 | 5.8% |
| O | 2412 | 2.1% |
| Other values (3) | 5660 | 5.0% |
Space Separator
| Value | Count | Frequency (%) |
| 47777 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 9012 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 792588 | |
| Common | 56789 | 6.7% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 104873 | 13.2% |
| o | 74416 | 9.4% |
| u | 58358 | 7.4% |
| e | 47526 | 6.0% |
| t | 45336 | 5.7% |
| i | 40530 | 5.1% |
| n | 38790 | 4.9% |
| l | 38701 | 4.9% |
| r | 38010 | 4.8% |
| c | 28969 | 3.7% |
| Other values (27) | 277079 |
Common
| Value | Count | Frequency (%) |
| 47777 | ||
| . | 9012 | 15.9% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 822099 | |
| None | 27278 | 3.2% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 104873 | 12.8% |
| o | 74416 | 9.1% |
| u | 58358 | 7.1% |
| 47777 | 5.8% | |
| e | 47526 | 5.8% |
| t | 45336 | 5.5% |
| i | 40530 | 4.9% |
| n | 38790 | 4.7% |
| l | 38701 | 4.7% |
| r | 38010 | 4.6% |
| Other values (25) | 287782 |
None
| Value | Count | Frequency (%) |
| á | 11768 | |
| é | 10686 | |
| ó | 2412 | 8.8% |
| Á | 2412 | 8.8% |
Numero_acta_transcrito
Unsupported
REJECTED  UNSUPPORTED 
| Missing | 0 |
|---|---|
| Missing (%) | 0.0% |
| Memory size | 756.7 KiB |
Procedencia_acta
Unsupported
MISSING  REJECTED  UNSUPPORTED 
| Missing | 25635 |
|---|---|
| Missing (%) | 26.5% |
| Memory size | 756.7 KiB |
| Distinct | 3709 |
|---|---|
| Distinct (%) | 3.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 756.7 KiB |
Length
| Max length | 43 |
|---|---|
| Median length | 3 |
| Mean length | 3.5325059 |
| Min length | 1 |
Characters and Unicode
| Total characters | 342102 |
|---|---|
| Distinct characters | 74 |
| Distinct categories | 9 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 2805 ? |
|---|---|
| Unique (%) | 2.9% |
Sample
| 1st row | S-D |
|---|---|
| 2nd row | S-D |
| 3rd row | S-D |
| 4th row | S-D |
| 5th row | S-D |
| Value | Count | Frequency (%) |
| s-d | 55880 | |
| tm | 7526 | 7.6% |
| tce | 6010 | 6.1% |
| bn | 1906 | 1.9% |
| cvg | 1306 | 1.3% |
| tct | 1182 | 1.2% |
| dispensa | 844 | 0.9% |
| hpafpc | 835 | 0.8% |
| aova | 804 | 0.8% |
| quemaduras | 762 | 0.8% |
| Other values (3215) | 21726 | 22.0% |
Most occurring characters
| Value | Count | Frequency (%) |
| S | 57783 | |
| D | 56974 | |
| - | 55897 | |
| T | 26375 | 7.7% |
| C | 17770 | 5.2% |
| A | 12181 | 3.6% |
| P | 11212 | 3.3% |
| M | 9882 | 2.9% |
| E | 9265 | 2.7% |
| N | 7975 | 2.3% |
| Other values (64) | 76788 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 239529 | |
| Dash Punctuation | 55897 | 16.3% |
| Lowercase Letter | 38116 | 11.1% |
| Math Symbol | 6093 | 1.8% |
| Space Separator | 1937 | 0.6% |
| Other Punctuation | 450 | 0.1% |
| Open Punctuation | 28 | < 0.1% |
| Decimal Number | 27 | < 0.1% |
| Close Punctuation | 25 | < 0.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 57783 | |
| D | 56974 | |
| T | 26375 | |
| C | 17770 | 7.4% |
| A | 12181 | 5.1% |
| P | 11212 | 4.7% |
| M | 9882 | 4.1% |
| E | 9265 | 3.9% |
| N | 7975 | 3.3% |
| H | 6757 | 2.8% |
| Other values (16) | 23355 |
Lowercase Letter
| Value | Count | Frequency (%) |
| s | 5216 | |
| e | 4795 | |
| a | 3988 | |
| n | 2887 | 7.6% |
| r | 2720 | 7.1% |
| u | 2719 | 7.1% |
| i | 2563 | 6.7% |
| o | 1890 | 5.0% |
| m | 1796 | 4.7% |
| x | 1688 | 4.4% |
| Other values (15) | 7854 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 377 | |
| ? | 23 | 5.1% |
| " | 19 | 4.2% |
| / | 19 | 4.2% |
| ' | 6 | 1.3% |
| , | 3 | 0.7% |
| : | 2 | 0.4% |
| * | 1 | 0.2% |
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 18 | |
| 4 | 3 | 11.1% |
| 9 | 2 | 7.4% |
| 6 | 1 | 3.7% |
| 1 | 1 | 3.7% |
| 5 | 1 | 3.7% |
| 3 | 1 | 3.7% |
Math Symbol
| Value | Count | Frequency (%) |
| + | 6092 | |
| = | 1 | < 0.1% |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 15 | |
| [ | 13 |
Close Punctuation
| Value | Count | Frequency (%) |
| ] | 13 | |
| ) | 12 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 55897 |
Space Separator
| Value | Count | Frequency (%) |
| 1937 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 277645 | |
| Common | 64457 | 18.8% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| S | 57783 | |
| D | 56974 | |
| T | 26375 | |
| C | 17770 | 6.4% |
| A | 12181 | 4.4% |
| P | 11212 | 4.0% |
| M | 9882 | 3.6% |
| E | 9265 | 3.3% |
| N | 7975 | 2.9% |
| H | 6757 | 2.4% |
| Other values (41) | 61471 |
Common
| Value | Count | Frequency (%) |
| - | 55897 | |
| + | 6092 | 9.5% |
| 1937 | 3.0% | |
| . | 377 | 0.6% |
| ? | 23 | < 0.1% |
| " | 19 | < 0.1% |
| / | 19 | < 0.1% |
| 2 | 18 | < 0.1% |
| ( | 15 | < 0.1% |
| ] | 13 | < 0.1% |
| Other values (13) | 47 | 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 342102 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| S | 57783 | |
| D | 56974 | |
| - | 55897 | |
| T | 26375 | 7.7% |
| C | 17770 | 5.2% |
| A | 12181 | 3.6% |
| P | 11212 | 3.3% |
| M | 9882 | 2.9% |
| E | 9265 | 2.7% |
| N | 7975 | 2.3% |
| Other values (64) | 76788 |
MISSING 
| Distinct | 108 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 8267 |
| Missing (%) | 8.5% |
| Memory size | 756.7 KiB |
Length
| Max length | 15 |
|---|---|
| Median length | 3 |
| Mean length | 3.1212391 |
| Min length | 1 |
Characters and Unicode
| Total characters | 276470 |
|---|---|
| Distinct characters | 25 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 3 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | S-D |
|---|---|
| 2nd row | S-D |
| 3rd row | S-D |
| 4th row | S-D |
| 5th row | S-D |
| Value | Count | Frequency (%) |
| s-d | 55895 | |
| tm | 7513 | 8.5% |
| tce | 5951 | 6.7% |
| bn | 1859 | 2.1% |
| cvg | 1291 | 1.5% |
| tct | 1177 | 1.3% |
| hpaf | 954 | 1.1% |
| dispensa | 918 | 1.0% |
| hpafpc | 896 | 1.0% |
| aova | 802 | 0.9% |
| Other values (100) | 11362 | 12.8% |
Most occurring characters
| Value | Count | Frequency (%) |
| S | 59139 | |
| D | 57006 | |
| - | 55895 | |
| T | 22573 | 8.2% |
| C | 13836 | 5.0% |
| A | 11151 | 4.0% |
| E | 9995 | 3.6% |
| M | 9299 | 3.4% |
| P | 8555 | 3.1% |
| N | 4940 | 1.8% |
| Other values (15) | 24081 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 220529 | |
| Dash Punctuation | 55895 | 20.2% |
| Space Separator | 41 | < 0.1% |
| Decimal Number | 5 | < 0.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 59139 | |
| D | 57006 | |
| T | 22573 | 10.2% |
| C | 13836 | 6.3% |
| A | 11151 | 5.1% |
| E | 9995 | 4.5% |
| M | 9299 | 4.2% |
| P | 8555 | 3.9% |
| N | 4940 | 2.2% |
| H | 4240 | 1.9% |
| Other values (12) | 19795 | 9.0% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 55895 |
Space Separator
| Value | Count | Frequency (%) |
| 41 |
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 5 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 220529 | |
| Common | 55941 | 20.2% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| S | 59139 | |
| D | 57006 | |
| T | 22573 | 10.2% |
| C | 13836 | 6.3% |
| A | 11151 | 5.1% |
| E | 9995 | 4.5% |
| M | 9299 | 4.2% |
| P | 8555 | 3.9% |
| N | 4940 | 2.2% |
| H | 4240 | 1.9% |
| Other values (12) | 19795 | 9.0% |
Common
| Value | Count | Frequency (%) |
| - | 55895 | |
| 41 | 0.1% | |
| 2 | 5 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 276470 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| S | 59139 | |
| D | 57006 | |
| - | 55895 | |
| T | 22573 | 8.2% |
| C | 13836 | 5.0% |
| A | 11151 | 4.0% |
| E | 9995 | 3.6% |
| M | 9299 | 3.4% |
| P | 8555 | 3.1% |
| N | 4940 | 1.8% |
| Other values (15) | 24081 |
MISSING 
| Distinct | 95 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 8267 |
| Missing (%) | 8.5% |
| Memory size | 756.7 KiB |
Length
| Max length | 48 |
|---|---|
| Median length | 9 |
| Mean length | 14.288483 |
| Min length | 5 |
Characters and Unicode
| Total characters | 1265631 |
|---|---|
| Distinct characters | 25 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 2 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | sin datos |
|---|---|
| 2nd row | sin datos |
| 3rd row | sin datos |
| 4th row | sin datos |
| 5th row | sin datos |
| Value | Count | Frequency (%) |
| sin | 55895 | |
| datos | 55895 | |
| traumatismo | 18218 | 9.1% |
| craneo | 9132 | 4.6% |
| multiple | 7513 | 3.8% |
| encefalico | 6049 | 3.0% |
| herida | 3656 | 1.8% |
| torax | 3571 | 1.8% |
| fuego | 3026 | 1.5% |
| arma | 3026 | 1.5% |
| Other values (91) | 34137 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 153991 | |
| s | 145022 | |
| o | 122268 | |
| i | 114315 | |
| t | 111748 | |
| 111541 | ||
| n | 95228 | |
| d | 68814 | 5.4% |
| e | 62608 | 4.9% |
| r | 58260 | 4.6% |
| Other values (15) | 221836 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 1154085 | |
| Space Separator | 111541 | 8.8% |
| Decimal Number | 5 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 153991 | |
| s | 145022 | |
| o | 122268 | |
| i | 114315 | |
| t | 111748 | |
| n | 95228 | |
| d | 68814 | |
| e | 62608 | 5.4% |
| r | 58260 | 5.0% |
| m | 57531 | 5.0% |
| Other values (13) | 164300 |
Space Separator
| Value | Count | Frequency (%) |
| 111541 |
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 5 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1154085 | |
| Common | 111546 | 8.8% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 153991 | |
| s | 145022 | |
| o | 122268 | |
| i | 114315 | |
| t | 111748 | |
| n | 95228 | |
| d | 68814 | |
| e | 62608 | 5.4% |
| r | 58260 | 5.0% |
| m | 57531 | 5.0% |
| Other values (13) | 164300 |
Common
| Value | Count | Frequency (%) |
| 111541 | ||
| 2 | 5 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1265631 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 153991 | |
| s | 145022 | |
| o | 122268 | |
| i | 114315 | |
| t | 111748 | |
| 111541 | ||
| n | 95228 | |
| d | 68814 | 5.4% |
| e | 62608 | 4.9% |
| r | 58260 | 4.6% |
| Other values (15) | 221836 |
Sexo
Categorical
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 756.7 KiB |
| Masculino | |
|---|---|
| Femenino | |
| S-D | 2502 |
Length
| Max length | 9 |
|---|---|
| Median length | 9 |
| Mean length | 8.6322643 |
| Min length | 3 |
Characters and Unicode
| Total characters | 835983 |
|---|---|
| Distinct characters | 15 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Femenino |
|---|---|
| 2nd row | Femenino |
| 3rd row | Masculino |
| 4th row | Masculino |
| 5th row | Femenino |
Common Values
| Value | Count | Frequency (%) |
| Masculino | 73741 | |
| Femenino | 20601 | 21.3% |
| S-D | 2502 | 2.6% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| masculino | 73741 | |
| femenino | 20601 | 21.3% |
| s-d | 2502 | 2.6% |
Most occurring characters
| Value | Count | Frequency (%) |
| n | 114943 | |
| i | 94342 | |
| o | 94342 | |
| M | 73741 | |
| a | 73741 | |
| s | 73741 | |
| c | 73741 | |
| u | 73741 | |
| l | 73741 | |
| e | 41202 | 4.9% |
| Other values (5) | 48708 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 734135 | |
| Uppercase Letter | 99346 | 11.9% |
| Dash Punctuation | 2502 | 0.3% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| n | 114943 | |
| i | 94342 | |
| o | 94342 | |
| a | 73741 | |
| s | 73741 | |
| c | 73741 | |
| u | 73741 | |
| l | 73741 | |
| e | 41202 | 5.6% |
| m | 20601 | 2.8% |
Uppercase Letter
| Value | Count | Frequency (%) |
| M | 73741 | |
| F | 20601 | 20.7% |
| S | 2502 | 2.5% |
| D | 2502 | 2.5% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 2502 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 833481 | |
| Common | 2502 | 0.3% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| n | 114943 | |
| i | 94342 | |
| o | 94342 | |
| M | 73741 | |
| a | 73741 | |
| s | 73741 | |
| c | 73741 | |
| u | 73741 | |
| l | 73741 | |
| e | 41202 | 4.9% |
| Other values (4) | 46206 |
Common
| Value | Count | Frequency (%) |
| - | 2502 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 835983 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| n | 114943 | |
| i | 94342 | |
| o | 94342 | |
| M | 73741 | |
| a | 73741 | |
| s | 73741 | |
| c | 73741 | |
| u | 73741 | |
| l | 73741 | |
| e | 41202 | 4.9% |
| Other values (5) | 48708 |
Edad_transcrito
Unsupported
REJECTED  UNSUPPORTED 
| Missing | 0 |
|---|---|
| Missing (%) | 0.0% |
| Memory size | 756.7 KiB |
Tipo_restos
Categorical
IMBALANCE 
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 756.7 KiB |
| Cadáver | |
|---|---|
| Feto | 2844 |
| Miembros | 2065 |
| Recién nacido | 633 |
| Restos óseos | 48 |
Length
| Max length | 13 |
|---|---|
| Median length | 7 |
| Mean length | 6.9749184 |
| Min length | 4 |
Characters and Unicode
| Total characters | 675479 |
|---|---|
| Distinct characters | 21 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 2 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Cadáver |
|---|---|
| 2nd row | Cadáver |
| 3rd row | Cadáver |
| 4th row | Cadáver |
| 5th row | Cadáver |
Common Values
| Value | Count | Frequency (%) |
| Cadáver | 91254 | |
| Feto | 2844 | 2.9% |
| Miembros | 2065 | 2.1% |
| Recién nacido | 633 | 0.7% |
| Restos óseos | 48 | < 0.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| cadáver | 91254 | |
| feto | 2844 | 2.9% |
| miembros | 2065 | 2.1% |
| recién | 633 | 0.6% |
| nacido | 633 | 0.6% |
| restos | 48 | < 0.1% |
| óseos | 48 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 96892 | |
| r | 93319 | |
| d | 91887 | |
| a | 91887 | |
| C | 91254 | |
| á | 91254 | |
| v | 91254 | |
| o | 5638 | 0.8% |
| i | 3331 | 0.5% |
| t | 2892 | 0.4% |
| Other values (11) | 15871 | 2.3% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 577954 | |
| Uppercase Letter | 96844 | 14.3% |
| Space Separator | 681 | 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 96892 | |
| r | 93319 | |
| d | 91887 | |
| a | 91887 | |
| á | 91254 | |
| v | 91254 | |
| o | 5638 | 1.0% |
| i | 3331 | 0.6% |
| t | 2892 | 0.5% |
| s | 2257 | 0.4% |
| Other values (6) | 7343 | 1.3% |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 91254 | |
| F | 2844 | 2.9% |
| M | 2065 | 2.1% |
| R | 681 | 0.7% |
Space Separator
| Value | Count | Frequency (%) |
| 681 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 674798 | |
| Common | 681 | 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 96892 | |
| r | 93319 | |
| d | 91887 | |
| a | 91887 | |
| C | 91254 | |
| á | 91254 | |
| v | 91254 | |
| o | 5638 | 0.8% |
| i | 3331 | 0.5% |
| t | 2892 | 0.4% |
| Other values (10) | 15190 | 2.3% |
Common
| Value | Count | Frequency (%) |
| 681 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 583544 | |
| None | 91935 | 13.6% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 96892 | |
| r | 93319 | |
| d | 91887 | |
| a | 91887 | |
| C | 91254 | |
| v | 91254 | |
| o | 5638 | 1.0% |
| i | 3331 | 0.6% |
| t | 2892 | 0.5% |
| F | 2844 | 0.5% |
| Other values (8) | 12346 | 2.1% |
None
| Value | Count | Frequency (%) |
| á | 91254 | |
| é | 633 | 0.7% |
| ó | 48 | 0.1% |
Bitacora_ingresos
Categorical
| Distinct | 15 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 756.7 KiB |
| semefo_df_bo_1981 | |
|---|---|
| semefo_df_bo_1980 | |
| semefo_df_bo_1982 | |
| semefo_df_bo_1979 | |
| semefo_df_bo_1977 | |
| Other values (10) |
Length
| Max length | 17 |
|---|---|
| Median length | 17 |
| Mean length | 17 |
| Min length | 17 |
Characters and Unicode
| Total characters | 1646348 |
|---|---|
| Distinct characters | 18 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | semefo_df_bo_1968 |
|---|---|
| 2nd row | semefo_df_bo_1968 |
| 3rd row | semefo_df_bo_1968 |
| 4th row | semefo_df_bo_1968 |
| 5th row | semefo_df_bo_1968 |
Common Values
| Value | Count | Frequency (%) |
| semefo_df_bo_1981 | 7629 | 7.9% |
| semefo_df_bo_1980 | 7619 | 7.9% |
| semefo_df_bo_1982 | 7493 | 7.7% |
| semefo_df_bo_1979 | 7461 | 7.7% |
| semefo_df_bo_1977 | 7166 | 7.4% |
| semefo_df_bo_1978 | 7140 | 7.4% |
| semefo_df_bo_1976 | 6907 | 7.1% |
| semefo_df_bo_1975 | 6861 | 7.1% |
| semefo_df_bo_1973 | 6471 | 6.7% |
| semefo_df_bo_1972 | 5768 | 6.0% |
| Other values (5) | 26329 |
Length
| Value | Count | Frequency (%) |
| semefo_df_bo_1981 | 7629 | 7.9% |
| semefo_df_bo_1980 | 7619 | 7.9% |
| semefo_df_bo_1982 | 7493 | 7.7% |
| semefo_df_bo_1979 | 7461 | 7.7% |
| semefo_df_bo_1977 | 7166 | 7.4% |
| semefo_df_bo_1978 | 7140 | 7.4% |
| semefo_df_bo_1976 | 6907 | 7.1% |
| semefo_df_bo_1975 | 6861 | 7.1% |
| semefo_df_bo_1973 | 6471 | 6.7% |
| semefo_df_bo_1972 | 5768 | 6.0% |
| Other values (5) | 26329 |
Most occurring characters
| Value | Count | Frequency (%) |
| _ | 290532 | |
| e | 193688 | |
| f | 193688 | |
| o | 193688 | |
| 1 | 109997 | 6.7% |
| 9 | 109357 | 6.6% |
| b | 96844 | 5.9% |
| s | 96844 | 5.9% |
| d | 96844 | 5.9% |
| m | 96844 | 5.9% |
| Other values (8) | 168022 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 968440 | |
| Decimal Number | 387376 | 23.5% |
| Connector Punctuation | 290532 | 17.6% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 109997 | |
| 9 | 109357 | |
| 7 | 71498 | |
| 8 | 34600 | 8.9% |
| 6 | 16678 | 4.3% |
| 2 | 13261 | 3.4% |
| 0 | 12922 | 3.3% |
| 5 | 6861 | 1.8% |
| 3 | 6471 | 1.7% |
| 4 | 5731 | 1.5% |
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 193688 | |
| f | 193688 | |
| o | 193688 | |
| b | 96844 | |
| s | 96844 | |
| d | 96844 | |
| m | 96844 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 290532 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 968440 | |
| Common | 677908 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| _ | 290532 | |
| 1 | 109997 | 16.2% |
| 9 | 109357 | 16.1% |
| 7 | 71498 | 10.5% |
| 8 | 34600 | 5.1% |
| 6 | 16678 | 2.5% |
| 2 | 13261 | 2.0% |
| 0 | 12922 | 1.9% |
| 5 | 6861 | 1.0% |
| 3 | 6471 | 1.0% |
Latin
| Value | Count | Frequency (%) |
| e | 193688 | |
| f | 193688 | |
| o | 193688 | |
| b | 96844 | |
| s | 96844 | |
| d | 96844 | |
| m | 96844 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1646348 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| _ | 290532 | |
| e | 193688 | |
| f | 193688 | |
| o | 193688 | |
| 1 | 109997 | 6.7% |
| 9 | 109357 | 6.6% |
| b | 96844 | 5.9% |
| s | 96844 | 5.9% |
| d | 96844 | 5.9% |
| m | 96844 | 5.9% |
| Other values (8) | 168022 |
Pagina_PDF
Real number (ℝ)
| Distinct | 260 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 108.24627 |
| Minimum | 2 |
|---|---|
| Maximum | 261 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 756.7 KiB |
Quantile statistics
| Minimum | 2 |
|---|---|
| 5-th percentile | 12 |
| Q1 | 53 |
| median | 104 |
| Q3 | 156 |
| 95-th percentile | 226 |
| Maximum | 261 |
| Range | 259 |
| Interquartile range (IQR) | 103 |
Descriptive statistics
| Standard deviation | 65.798085 |
|---|---|
| Coefficient of variation (CV) | 0.60785543 |
| Kurtosis | -0.86945586 |
| Mean | 108.24627 |
| Median Absolute Deviation (MAD) | 52 |
| Skewness | 0.29364454 |
| Sum | 10483002 |
| Variance | 4329.388 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 31 | 524 | 0.5% |
| 123 | 520 | 0.5% |
| 103 | 519 | 0.5% |
| 63 | 516 | 0.5% |
| 108 | 515 | 0.5% |
| 119 | 515 | 0.5% |
| 28 | 515 | 0.5% |
| 5 | 514 | 0.5% |
| 107 | 514 | 0.5% |
| 61 | 514 | 0.5% |
| Other values (250) | 91678 |
| Value | Count | Frequency (%) |
| 2 | 68 | 0.1% |
| 3 | 443 | |
| 4 | 513 | |
| 5 | 514 | |
| 6 | 513 | |
| 7 | 512 | |
| 8 | 473 | |
| 9 | 486 | |
| 10 | 452 | |
| 11 | 512 |
| Value | Count | Frequency (%) |
| 261 | 4 | < 0.1% |
| 260 | 22 | < 0.1% |
| 259 | 34 | < 0.1% |
| 258 | 36 | < 0.1% |
| 257 | 34 | < 0.1% |
| 256 | 61 | |
| 255 | 76 | |
| 254 | 90 | |
| 253 | 135 | |
| 252 | 114 |
Foja_transcrito
Unsupported
REJECTED  UNSUPPORTED 
| Missing | 0 |
|---|---|
| Missing (%) | 0.0% |
| Memory size | 756.7 KiB |
Observaciones
Unsupported
MISSING  REJECTED  UNSUPPORTED 
| Missing | 82723 |
|---|---|
| Missing (%) | 85.4% |
| Memory size | 756.7 KiB |
Conocido_desconocido
Categorical
IMBALANCE 
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 756.7 KiB |
| conocido | |
|---|---|
| desconocido | |
| S-D | 19 |
Length
| Max length | 11 |
|---|---|
| Median length | 8 |
| Mean length | 8.5631841 |
| Min length | 3 |
Characters and Unicode
| Total characters | 829293 |
|---|---|
| Distinct characters | 10 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | conocido |
|---|---|
| 2nd row | conocido |
| 3rd row | conocido |
| 4th row | conocido |
| 5th row | conocido |
Common Values
| Value | Count | Frequency (%) |
| conocido | 78613 | |
| desconocido | 18212 | 18.8% |
| S-D | 19 | < 0.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| conocido | 78613 | |
| desconocido | 18212 | 18.8% |
| s-d | 19 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| o | 290475 | |
| c | 193650 | |
| d | 115037 | 13.9% |
| n | 96825 | 11.7% |
| i | 96825 | 11.7% |
| e | 18212 | 2.2% |
| s | 18212 | 2.2% |
| S | 19 | < 0.1% |
| - | 19 | < 0.1% |
| D | 19 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 829236 | |
| Uppercase Letter | 38 | < 0.1% |
| Dash Punctuation | 19 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| o | 290475 | |
| c | 193650 | |
| d | 115037 | 13.9% |
| n | 96825 | 11.7% |
| i | 96825 | 11.7% |
| e | 18212 | 2.2% |
| s | 18212 | 2.2% |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 19 | |
| D | 19 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 19 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 829274 | |
| Common | 19 | < 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| o | 290475 | |
| c | 193650 | |
| d | 115037 | 13.9% |
| n | 96825 | 11.7% |
| i | 96825 | 11.7% |
| e | 18212 | 2.2% |
| s | 18212 | 2.2% |
| S | 19 | < 0.1% |
| D | 19 | < 0.1% |
Common
| Value | Count | Frequency (%) |
| - | 19 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 829293 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| o | 290475 | |
| c | 193650 | |
| d | 115037 | 13.9% |
| n | 96825 | 11.7% |
| i | 96825 | 11.7% |
| e | 18212 | 2.2% |
| s | 18212 | 2.2% |
| S | 19 | < 0.1% |
| - | 19 | < 0.1% |
| D | 19 | < 0.1% |
| Bitacora_ingresos | Conocido_desconocido | Pagina_PDF | Procedencia_alcaldia | Sexo | Tipo_restos | |
|---|---|---|---|---|---|---|
| Bitacora_ingresos | 1.000 | 0.047 | 0.217 | 0.073 | 0.101 | 0.042 |
| Conocido_desconocido | 0.047 | 1.000 | 0.303 | 0.172 | 0.193 | 0.279 |
| Pagina_PDF | 0.217 | 0.303 | 1.000 | 0.033 | 0.130 | 0.287 |
| Procedencia_alcaldia | 0.073 | 0.172 | 0.033 | 1.000 | 0.051 | 0.059 |
| Sexo | 0.101 | 0.193 | 0.130 | 0.051 | 1.000 | 0.460 |
| Tipo_restos | 0.042 | 0.279 | 0.287 | 0.059 | 0.460 | 1.000 |
| ID | Numero_progresivo_transcrito | Nombre_completo_transcrito | Primer_apellido | Segundo_apellido | Nombres_propios | Fecha_transcrito | Fecha_estandar | Expediente_SEMEFO_transcrito | Procedencia_transcrito | Procedencia_estandar | Procedencia_direccion | Procedencia_alcaldia | Numero_acta_transcrito | Procedencia_acta | Diagnostico_transcrito | Diagnostico_estandar | Diagnostico_extendido | Sexo | Edad_transcrito | Tipo_restos | Bitacora_ingresos | Pagina_PDF | Foja_transcrito | Observaciones | Conocido_desconocido | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | BO_1968_00001 | S-D | acosta ortega teresa | acosta | ortega | teresa | 1968-01-03 00:00:00 | 1968-01-03 | 37 | S-D | S-D | Sin datos | NaN | S-D | NaN | S-D | S-D | sin datos | Femenino | S-D | Cadáver | semefo_df_bo_1968 | 2 | 1 | NaN | conocido |
| 1 | BO_1968_00002 | S-D | avila de cuestas catalina | avila | de cuestas | catalina | 1968-01-05 00:00:00 | 1968-01-05 | 58 | S-D | S-D | Sin datos | NaN | S-D | NaN | S-D | S-D | sin datos | Femenino | S-D | Cadáver | semefo_df_bo_1968 | 2 | 1 | NaN | conocido |
| 2 | BO_1968_00003 | S-D | arzate paredes juan | arzate | paredes | juan | 1968-01-07 00:00:00 | 1968-01-07 | 83 | S-D | S-D | Sin datos | NaN | S-D | NaN | S-D | S-D | sin datos | Masculino | S-D | Cadáver | semefo_df_bo_1968 | 2 | 1 | NaN | conocido |
| 3 | BO_1968_00004 | S-D | alvarez martinez isaac | alvarez | martinez | isaac | 1968-01-07 00:00:00 | 1968-01-07 | 86 | S-D | S-D | Sin datos | NaN | S-D | NaN | S-D | S-D | sin datos | Masculino | S-D | Cadáver | semefo_df_bo_1968 | 2 | 1 | NaN | conocido |
| 4 | BO_1968_00005 | S-D | arellano viuda de campos ma. | arellano | viuda de campos | ma. | 1968-01-07 00:00:00 | 1968-01-07 | 88 | S-D | S-D | Sin datos | NaN | S-D | NaN | S-D | S-D | sin datos | Femenino | S-D | Cadáver | semefo_df_bo_1968 | 2 | 1 | NaN | conocido |
| 5 | BO_1968_00006 | S-D | arce macedo justo | arce | macedo | justo | 1968-01-09 00:00:00 | 1968-01-09 | 115 | S-D | S-D | Sin datos | NaN | S-D | NaN | S-D | S-D | sin datos | Masculino | S-D | Cadáver | semefo_df_bo_1968 | 2 | 1 | NaN | conocido |
| 6 | BO_1968_00007 | S-D | alvarez vela jesus | alvarez | vela | jesus | 1968-01-02 00:00:00 | 1968-01-02 | 22 | S-D | S-D | Sin datos | NaN | S-D | NaN | S-D | S-D | sin datos | Masculino | S-D | Cadáver | semefo_df_bo_1968 | 2 | 1 | NaN | conocido |
| 7 | BO_1968_00008 | S-D | avila ramirez pablo | avila | ramirez | pablo | 1968-01-10 00:00:00 | 1968-01-10 | 137 | S-D | S-D | Sin datos | NaN | S-D | NaN | S-D | S-D | sin datos | Masculino | S-D | Cadáver | semefo_df_bo_1968 | 2 | 1 | NaN | conocido |
| 8 | BO_1968_00009 | S-D | alvarado aurelio | alvarado | s-d | aurelio | 1968-01-10 00:00:00 | 1968-01-10 | 139 | S-D | S-D | Sin datos | NaN | S-D | NaN | S-D | S-D | sin datos | Masculino | S-D | Cadáver | semefo_df_bo_1968 | 2 | 1 | NaN | conocido |
| 9 | BO_1968_00010 | S-D | alvarez almaguer arturo | alvarez | almaguer | arturo | 1968-01-10 00:00:00 | 1968-01-10 | 132 | S-D | S-D | Sin datos | NaN | S-D | NaN | S-D | S-D | sin datos | Masculino | S-D | Cadáver | semefo_df_bo_1968 | 2 | 1 | NaN | conocido |
| ID | Numero_progresivo_transcrito | Nombre_completo_transcrito | Primer_apellido | Segundo_apellido | Nombres_propios | Fecha_transcrito | Fecha_estandar | Expediente_SEMEFO_transcrito | Procedencia_transcrito | Procedencia_estandar | Procedencia_direccion | Procedencia_alcaldia | Numero_acta_transcrito | Procedencia_acta | Diagnostico_transcrito | Diagnostico_estandar | Diagnostico_extendido | Sexo | Edad_transcrito | Tipo_restos | Bitacora_ingresos | Pagina_PDF | Foja_transcrito | Observaciones | Conocido_desconocido | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 96834 | BO_1982_07484 | S-D | Nombre de particular que podría encontrarse con vida. Se clasifica como confidencial con fundamento en el artículo 116 de la LGTAIP. | NaN | s-d | NaN | 1982-10-27 00:00:00 | 1982-10-27 | 6040 | 32a | 32A | Ministerio Público 32 | NaN | 1423 | 32a -- 1423 | S-D | S-D | sin datos | Masculino | S-D | Miembros | semefo_df_bo_1982 | 250 | 155 | punto rojo en expediente_semefo y raya roja en procedencia. no copias rl | conocido |
| 96835 | BO_1982_07485 | S-D | Nombre de particular que podría encontrarse con vida. Se clasifica como confidencial con fundamento en el artículo 116 de la LGTAIP. | NaN | s-d | NaN | 1982-10-27 00:00:00 | 1982-10-27 | 6041 | 34a | 34A | Ministerio Público 34 (Col Santo Tomás) | Miguel Hidalgo | 1454 | 34a -- 1454 | S-D | S-D | sin datos | Masculino | S-D | Miembros | semefo_df_bo_1982 | 250 | 155 | punto rojo en expediente_semefo y raya roja en procedencia | conocido |
| 96836 | BO_1982_07486 | S-D | Nombre de particular que podría encontrarse con vida. Se clasifica como confidencial con fundamento en el artículo 116 de la LGTAIP. | NaN | s-d | NaN | 1982-03-11 00:00:00 | 1982-03-11 | 1453 | 37a | 37A | Ministerio Público 37 (Col Polanco) | Miguel Hidalgo | 224 | 37a -- 224 | S-D | S-D | sin datos | Masculino | S-D | Miembros | semefo_df_bo_1982 | 251 | 156 | NaN | conocido |
| 96837 | BO_1982_07487 | S-D | restos placentarios | s-d | s-d | s-d | 1982-06-10 00:00:00 | 1982-06-10 | 3258 | 8a | 8A | Ministerio Público 8 (Col Narvarte) | Benito Juárez | 1803 | 8a -- 1803 | S-D | S-D | sin datos | S-D | S-D | Miembros | semefo_df_bo_1982 | 251 | 156 | NaN | desconocido |
| 96838 | BO_1982_07488 | S-D | restos humanos de desconocido | s-d | s-d | s-d | 1982-05-03 00:00:00 | 1982-05-03 | 1100 | 23a | 23A | Ministerio Público 23 (Col La Joya Tlalpan) | Tlalpan | 405 | 23a -- 405 | S-D | S-D | sin datos | S-D | S-D | Miembros | semefo_df_bo_1982 | 251 | 156 | NaN | desconocido |
| 96839 | BO_1982_07489 | S-D | placenta | s-d | s-d | s-d | 1982-06-05 00:00:00 | 1982-06-05 | 3079 | 15a | 15A | Ministerio Público 15 (Col Aragón La Villa) | Gustavo A. Madero | 960 | 15a -- 960 | S-D | S-D | sin datos | S-D | S-D | Miembros | semefo_df_bo_1982 | 251 | 156 | NaN | desconocido |
| 96840 | BO_1982_07490 | S-D | 5 dedos del pie derecho de desconocido | s-d | s-d | s-d | 1982-06-05 00:00:00 | 1982-06-05 | 3060 | 32a | 32A | Ministerio Público 32 | NaN | 950 | 32a -- 950 | S-D | S-D | sin datos | S-D | S-D | Miembros | semefo_df_bo_1982 | 251 | 156 | NaN | desconocido |
| 96841 | BO_1982_07491 | S-D | dedo de desconocido | s-d | s-d | s-d | 1982-11-19 00:00:00 | 1982-11-19 | 6389 | 32a | 32A | Ministerio Público 32 | NaN | 2005 | 32a -- 2005 | S-D | S-D | sin datos | S-D | S-D | Miembros | semefo_df_bo_1982 | 251 | 156 | NaN | desconocido |
| 96842 | BO_1982_07492 | S-D | 4 dedos de desconocido | s-d | s-d | s-d | 1982-11-28 00:00:00 | 1982-11-28 | 6528 | 27a | 27A | Ministerio Público 27 (San Pedro) | Xochimilco | 959 | 27a -- 959 | S-D | S-D | sin datos | S-D | S-D | Miembros | semefo_df_bo_1982 | 251 | 156 | NaN | desconocido |
| 96843 | BO_1982_07493 | S-D | osamenta de desconocido | s-d | s-d | s-d | 1982-10-11 00:00:00 | 1982-10-11 | 5629 | 9a | 9A | Ministerio Público 9 (Col Tacuba) | Miguel Hidalgo | 4193 | 9a -- 4193 | S-D | S-D | sin datos | S-D | S-D | Cadáver | semefo_df_bo_1982 | 251 | 156 | no se recibio necropsia. es una osamenta. | desconocido |